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Abstract 

The purpose of this paper is to investigate RBF approximation with highly nonuniform 
centers. Recently, DeVore and Ron have developed a notion of the local density of a set of 
centers - a notion that permits precise pointwise error estimates for surface spline approximation. 
We give an equivalent, alternative characterization of local density, one that allows effective 
placement of centers at different resolutions. 



1 Introduction 



This brief article concerns local approximation results for radial basis function (RBF) approximation 
with the goal of effectively placing centers at varying resolutions. We consider RBF approximants 
of the form x i— > Yl^eE 4'ix~C)j where the arrangement of centers, H, may be highly nonuniform. A 
motivation for this set up is that centers may be placed strategically to treat defects in the target 
function. This becomes very important in high dimensions, where conventional quasi-uniform 
placement of centers is extremely costly; error estimates assuming a (small) fill distance h require 
a placement of 0{h~'^) centers; obtaining a comparable error with fewer centers is clearly desirable. 
To this end, we seek a method by which H can be chosen to achieve a pointwise error that reflects 
the local arrangement of H. In turn, this requires finding a useful measure of the local density of H. 

In [3], DeVore and Ron establish powerful local error estimates for kernel based approximation; 
along the way they develop a satisfactory notion of local density - the majorant - expressed as a 
function over a domain containing the centers (see ([2]) below). This function gives, roughly, the 
distance to the nearest unisolvent subset of H. However, it also satisfies an extra condition of global 
compatibility: it may not grow or shrink too rapidly. This condition is not stated explicitly, rather 
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it is contained in the definition, but it plays an essential role in their local error estimates. In this 
article we investigate this compatibility condition and give an equivalent condition that lends itself 
to effective placement of highly nonuniform centers. 

Error estimates in |3j show that kernel approximation bears a strong similarity to univariate spline 
approximation, which exhibits local convergence in the sense that error decays rapidly over intervals 
where breakpoints are tightly spaced. Indeed, such results have long been known for spline quasi- 
interpolation, at least since [1]. If Qt is the quasi-interpolation operator of order r associated with 
knots T = (tj), then [2, (4.18)] in conjunction with a theorem of Whitney [21 Theorem 4.2] tells us 
that for X € [tj,tj+i], 

\f{x) - QT{f){x)\ < CrU;r+lif,tj+r - tj-r+l)- 

We note that the error from spline approximation at x depends only on the distribution of knots 
near x (the r*^ nearest neighbors) and the smoothness of the target function in a neighborhood of 
X. This result can be attributed to the local nature of the B-splines basis (and its associated dual 
functionals). 

Because kernels are often globally supported, and because a truly local basis similar to the B- 
splines seems to be out of the question, the distance to the nearest neighbors may be unsuitable for 
measuring local density (see Remark 3 below for evidence to the contrary). Kernel approximants 
exhibit far field effects, meaning that each kernel has a global influence, and the majorant of 
DeVore-Ron, via the global compatibility condition, penalizes remote, sparse density in an effort 
to mitigate such far field effects. 

In the following section, we present an alternative, equivalent characterization of the majorant of 
Devore and Ron, one that involves a global compatibility condition, and we give a self-contained 
development of their pointwise error estimates that explicitly uses this compatibility condition. In 
Section 3, we discuss how the compatibility condition may be used to place centers nonuniformly. 



2 Local Estimates 

In [3], DeVore and Ron construct a local approximation scheme using a simple measure of the local 
density of centers. This initial notion of density (given in Definition [1]) is not suitable to capture far 
field effects, and the analysis of the scheme's convergence eventually relies on a more refined notion 
of local density: the initial density's majorant. In this section, we recast the DeVore -Ron result 
with a different but equivalent local density parameter, one that lends itself to efficient placement 
of centers. As in [3], we begin by giving the initial definition of local density, a function on which 
at each a G M"^ indicates a radius sufficient to capture a i^-stable local polynomial reproduction of 
order £: 

Definition 1 (Local Density). Given a set of centers H C M'^, a local density (LD) p : M'^ ^ 
is a function with an associated local polynomial reproduction of precision L That is to say, there 
is a kernel a : H x M'^ — >• M : (^, a) i— > a(^, a) so that the following hold: 
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(Support) For — a\> p{a), a{£,,a) = 0. 

(Precision) For all p E 11^ we have a('?, =p{a). 

(Stability) There is K > such that a)\ < K for all a. 

This construction is allows the surface spline, (p (also known as polyharmonic splines because the are 
the fundamental solution of the A;-fold Laplacian A'^), to be approximated by a linear combination 
of nearby shifts. This is accomplished with a small, local error: 



(X — a) 



^a(e,a)<A(x-e)|<Cp(a)2^-'^(l + ^^) \ u:=l + d-2k, 
and one can generate the approximant: 



«6 

This leads to convenient pointwise error estimates: 



\fix)-T^f{x)\<C [ |AV(«)|p(«)''=-'^fl + ^^Vda. (1) 

Observe that the construction of the approximation operator T= depends only on a local poly- 
nomial reproduction (^, a) i-^ a(^,a). The estimate ([T]) holds for any LD with local polynomial 
reproduction a. In particular, it holds for any function p' with p' > p. With this in mind, it may 
seem tempting to use an optimally small p: the LD so that p{a) is the minimal radius around a 
that captures a i('-stable polynomial reproduction of order £. Sadly, because of the global nature 
of the kernels, this is unsuitable for producing precise estimates. 



H{x) = sup p{y) ( 1 + ^-^ ) . (2) 



In [3] the LD is used to create a "majorant": 

sup P J , ^ : , , 

yeRrf V p{y) 

In short, the coefficient kernel associated to the LD is used to create the approximant, but, in order 
to attack the estimate ([1]), an expression involving the majorant controls the error. The initial LD 
only plays an ancillary role: to construct the approximant via the coefficient kernel and to generate 
the majorant. 

To be sure, the entire exercise could be repeated using only H and entirely without p. It is a simple 
task to show that H is itself an LD (since H{x) > p{x) and the local polynomial reproduction 
(^,x) I— > a(^, is a local polynomial reproduction for H as well). Moreover, the majorant of H is 
simply a constant multiple of H (a constant depending only on r). It follows that by replacing the 
initial LD p with its majorant one would obtain the same results. 
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Alternately, from the beginning one may impose the assumption that the LD is equivalent to its 
majorant (i.e., it is self-majorizing) . This is a change of perspective: from the point of view that 
centers have been given outside of our control (and with the goal of remaining faithful to the local 
distribution of data by having error estimates reflecting the local density) to the setting where the 
spacing of centers is chosen to reflect regions of interest or to attack defects in the target function. 
We will choose centers having an LD that is correct from the start, having an extra condition 
designed to handle far field effects. The condition on the LD will be different from that of [3j; 
it is a slightly more easily verified property: slow growth. To proceed, we formalize both extra 
assumptions (slow growth and self-majorization) under the heading of global compatibility and 
take a moment to discuss their equivalence. 

Definition 2 (Global Compatibility). // there is a constant Cgg > so that for every x and a, we 
have ^ 

p{a) < Csgp{x) (l + , (3) 



'sgh 

we say p exhibits 1 — e slow growth. 



p{x) 



The function p : Q ^ M_|_ exhibits self-majorization of order r if there is a constant Cgm > so 
that for every x and y, we have 

p{y)>Cs^p{x){l+'^^^^ ' (4) 



The equivalence of these two assumptions can be expressed formally: 

Lemma 3. // p satisfies the property of self-majorization ^ then it satisfies the property of slow 
growth ^ with e = and constant Cgg depending only on r and Cgm- Likewise, if p satisfies ^ 
then it satisfies 0) with r = and constant Cgm depending only on e and Cgg- 



Proof. When |x — a| > p{x), ([3|) implies that p{a) < 2^ "^CsgPixYlx — so 



On the other hand, when |x — a| < p{x), p{a) < 2^ ^Csgp{x), so p satisfies self-majorization with 



Csg V /'(« 

ci iiaiiu, wiicii \ju — Lt| ^ fjyjbjj l-zy^) ^ ^ 

Csm = min 

When Ix — a| > p{x), ([4|) implies that p{a) > 2"'^CsmP{xy^^\x — a]'"^ , so 

—Pi^r^ f^y>Kx)i+^ 



Csm V 

On the other hand, when |x — a| < p{x), p{a) < 2~^CsmPix), so p satisfies slow growth with 
Cr, = max ( 7?^, (^] ^ ^ 1 . □ 
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Either of these extra assumptions on p are sufficient to obtain the error estimate in [3] : 

Theorem 4 (DeVore Ron I). Let £ > 2k — d+1. Suppose that p satisfies Slow Growth with e > 
There is a constant C so that for f G C^^(M'^) having compact support, 

|/(x)-rH/(x)|<Cp(x)2^||AV||oo 



Proof. This follows by applying the growth assumption to ([T]) and writing 7 = 1 — e to obtain: 

r / I _ I \ 7(2fe-rf) / \ 

\f{x)-T^f{x)\ < C IaVHIKx^'MI + ^T^ 1+ . |w da 

V V Pi-) J \ (1 + ^) / 

< Cp(x)2'=-iAV(a)||oc / 1 + da 

/•CO 

Jo 

The second inequality follows by writing (2k — d)'y — u[l — 7) = {2k — d)^ — {(. — 2k + d){\ — 7). 
The convergence of the last integral is a consequence of the assumption 7 < 1 — □ 



A further result from [3], is that functions of lower smoothness can also be treated with local error 
estimates. The operator is instrumental in obtaining low smoothness results, albeit indirectly. 
This is the point of [H Theorem 5.3], which, for completeness, we rephrase in a simplified form 
as Theorem [5] in terms of the slow growth assumption. The lower order result is technically more 
complicated than that of TheoremHl It is a common technique to use interpolation theory to obtain 
direct approximation results for functions of lower smoothness. DeVore and Ron use an argument 
of this type, that splits / into a rough but benign part, 6, and a smooth part, g. This is done in a 
way that is not entirely straightforward, by controlling the size of h (and the smoothness of g) in a 
precise way to match the local density. 

To discuss lower smoothness estimates, we first introduce fractional smoothness spaces. These 
can be expressed in numerous different ways: as Besov spaces, Triebel-Lizorkin spaces or (more 
familiarly) Holder- Zygmund spaces. In the setting we consider, these are the same spaces. That is, 
we consider ^^(M^) = B'^ 00^'^) ~ C'^^'^iW^). However, the exact smoothness norm we employ 
is the -B^ ^ norm, defined in terms of wavelet coefficients. Smooth functions can be expanded as 
/ = YlJLo'^j'^j ^^d the smoothness seminorm is expressed in terms of coefficients cj. 

A totally standard construction, used also in [3], indexes wavelets by gendered, dyadic cubes: i' € T), 
where each v = {ey,Iy) is a pair comprising: 

• a gender e = 6,^ € 0, l'' \ {0} 

• and a dyadic cube I := ly = 2~^{k + [0, l]'^). 



5 



For a general dyadic cube of this form we denote the corner by c(I) := l^^k and the sidelength 
by £(/) := 2~K These definitions extend for gendered cubes: c{v) := c{Iu) and l{v) := 
Under this indexing, each gendered cube v has exactly one parent v' , where ly C = Cyi and 

The wavelet system we employ is a family of C, compactly supported functions, with r > 2k. Each 
wavelet is related to one of 2*^ — 1 prototypes by affine changes of variable: ipu{x) = ^e,, • 

other words, each wavelet is a translated, rescaled copy of one of the 2*^ — 1 functions \I'e € C^(M'^). 
Consequently the supports of wavelets are obtained by affine transformations, and each is contained 
in a ball with radius proportional to the sidelength and centered at the corner of the cube /(z^). 
I.e., there is F > so that for all € 2) 

:= sn^vi^y) = c{v)+l{v) x suppC^-gJ d B {c{v) ,T ^{v)) . 

For orthogonal wavelet systems, compactly supported continuous functions have the unique expan- 
sion / = YlueD fvi^i^- The smoothness seminorm of / is 

Theorem 5 (DeVore Ron II). Let £ > 2k — d + l. Suppose that p satisfies Slow Growth with e > 
There is C > so that for all compactly supported f G ^, a < 2k, there is sj^= G span((/),H) 
so that 

\f{x)-Sf,s\<Cpixr\\f\\BS^^^. 



Proof. We split f = g + b where |6(x)| < p{xY\f\Bz^^^ and \^^g{x)\ < p{xY ^^\f\Bg^^^- A con- 
sequence of this and Theorem[l]is that \f{x) — T'^g{x)\ < p{xY\\f\\Bg^ ^ and the theorem follows 
with s = T^g. 

To obtain the split, partition T) = Dg U D5 by selecting cubes ly according to the density over 1,^: 

V ^T>g iff £{u) > p{v) := maxp(?/). 

Define g := X^i/eDg fi^i^y — Yli(u)>p(v) fi^i^i^ Estimating the iterated Laplacian of a wavelet is 
straightforward: for x G I^,^ \/^^il)y{x)\ < C£{u)~'^^. Consequently, 

iA^(x)! <cY. fA^r'' <c\f\Bs^^^ ^(^y' ^ c'\f\Bs^ jpix)r-''. 

The final estimate deserves some explanation. Note that v G Vg and x G imply that > p{x). 
Finding j G Z so that 2^ > p{x) > 2^~^, the number of wavelets with £{1^) = 2^ that have x in their 
support is bounded, #{z/ : £{u) = 2^ ,x £ Ij^} < A^, with a constant independent of x and j. We 
may rewrite the sum in the next to last expression in the chain of inequalities as 

00 00 

^ £iur-^'^ ^ E E (2^'+T"'' < iV2^'(--'') E (^T"'' < c{pix)y-^\ 
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Estimating the size of b{x) := J2,yeDb write \b{x)\ < Ylu&Tii, x&l^ l-^^l' '^^i'^'^ bounded by 

l/lsg,^^ Ei^e^b.^G/, {^{^)Y- If ^ and y are in 7,, and if p{y) > £{u) then 

p{x)>Cs,nP{y)(^l+^-^^^ >£{l^)Csm{l + 2rr' i{v)<Cp{x) 

As in the case of g, it fohows that the series \h{x)\ < \f\B'^ ^Ylii{u)<Cp{x) xeiS^^^^)" '^^'^ 
rewritten as a sum of geometric series, to obtain \b{x)\ < C \f\Bs^ ^ {p{^))'^ ■ ^ 



3 Discussion 



We now turn to a discussion of how condition ([SD may be directly implemented to produce effective 
global approximants with local error estimates. 

Remark 1. It is possible to generate a set of centers with a fixed spacing, having a more refined 
spacing on a particular subset. This can be done in such a way that the pointwise error from surface 
spline approximation reflects the local arrangement of centers. 

In this example the global spacing is h = 2^^ , while the spacing near the origin will be /i^ = 2~'^K 
We choose to impose a slow growth condition with e = 1/3. By Theorem |4] we observe that the 
LD must have precision £ > 2/c/e, so we choose £ = 7k. The spacing of centers immediately around 
the origin should be 2"^-' , but there is an intermediate region where the spacing grows slowly. We 
decompose this in j regions of increasing width: 

nj:={x:\x\<7kx2^-'^^}\ \J ^i, J = 1, . . . , j 

0<£<J 

in which we place gridded centers with spacing hj = 2'^~^~^-^. (The presence of the factor 7k is 
only to ensure that an adequate unisolvent set is captured in each fij. Since the radii of these sets 
are shrinking more slowly than the spacing of the centers, this factor can be omitted for sufficiently 
large values of j.) For this distribution, and for points x in the transition region Uj=i have 
p{x) = p(0)(l + j^)^/^. (It is not hard to show that p{x) < p{y){l + ^Y^^ holds throughout 
W^). Thus, by Theorem [5] there is a constant C so that for any / with smoothness < s < 2A; we 
have a good approximant with dramatically increased accuracy at the origin: 

\\f -SfU<Ch'\\f\\s and \f{^)-Sf{Q)\<Ch^'\\f\l 

In fact, for x E Oj, one has \ f{x) — sj(x)| < C2'*("^^^-'^ The cardinality of centers in the ball 
5(0, 7k X 2-^1"^) is less than 

^(7A:)'^2'^(¥-2j)2^'^(-^-2,) < (7fc)d^2'^(^) < Q(7A:)'^2^(^). 
j=o j=o 

Since the same ball filled with centers having uniform spacing h = 2~^ holds roug hly (7A:)'^2'^(j/2) 
centers, the increased precision comes at a cost of adding only a multiple of the original centers. 
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Figure 1: A configuration of centers for thin plate spline approximation (k = 2, as in the first 
remark) where the density ranges from p(0) ^ = 2"^ to the coarsest density is p{x) ^ h = 2~^. 



We note that this example could easily be modified to place higher density centers on any subset 
X (curves, hypersurfaces, etc.). To do so, one produces rings fij around X and fills each ring with 

gridded centers having density 2^~'^^ as before. 

Remark 2. In the previous remark, the spacing of centers was varied over an intermediate region 
having radius roug hly /iV2. The size of this region is related to the choice of e in the slow growth 
condition, and ultimately to the degree of polynomial precision for the local polynomial reproduc- 
tion. For a similar set up, where a density of roughly is desired in a subset, we note that the 
radius of the transition region is determined by the inequality 
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which suggests that se should be less than 1 in order for the transition region to stay manageable 
(i.e., in order that it shrinks with h). In turn, this indicates that a polynomials of a sufficiently 
high order should be reproduced. Thus, for given s, e < 1/s and the degree of polynomial precision 
£ > 2k/e > 2ks. 

Remark 3. It should be noted that the "power function" method for estimating error from RBF 
interpolation, [4, Chapter 11], also provides local error estimates for RBF interpolation. The 
estimate on the power function ^ Theorem 11.9] can easily be rephrased in terms of a LD p of 
sufficient precision. For surface spline interpolation, the requirement is precision k — 1. 

Remarkably, no extra global compatibility assumption such as Definition [2] are needed for this. The 
pointwise estimate [H Theorem 11.16] can be stated as 

|/(x)-s/ = (x)!<Cp(x)'=-'=^/VIh/|(r^) 

for / G W^{R'^). 

The drawback of this result is that the target function must reside in the space W^iW^), rather 
than permitting error estimates for the entire scale of smooth functions C^'^(M'^) or B^^(R'^), 
a < 2k as in Theorem U and Theorem [5l 
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